Learning Neural Network Architectures using Backpropagation

نویسندگان

  • Suraj Srinivas
  • R. Venkatesh Babu
چکیده

Deep neural networks with millions of parameters are at the heart of many state of the art machine learning models today. However, recent works have shown that models with much smaller number of parameters can also perform just as well. In this work, we introduce the problem of architecture-learning, i.e; learning the architecture of a neural network along with weights. We start with a large neural network, and then learn which neurons to prune. To this end, we introduce a new trainable parameter called the Tri-State ReLU, which helps in pruning unnecessary neurons. We also propose a smooth regularizer which encourages the total number of neurons after elimination to be small. The resulting objective is differentiable and simple to optimize. We experimentally validate our method on both small and large networks, and show that it can learn models with considerably smaller number of parameters without affecting prediction accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting Force in Single Point Incremental Forming by Using Artificial Neural Network

In this study, an artificial neural network was used to predict the minimum force required to single point incremental forming (SPIF) of thin sheets of Aluminium AA3003-O and calamine brass Cu67Zn33 alloy. Accordingly, the parameters for processing, i.e., step depth, the feed rate of the tool, spindle speed, wall angle, thickness of metal sheets and type of material were selected as input and t...

متن کامل

Semi-Supervised Learning Based Prediction of Musculoskeletal Disorder Risk

This study explores a semi-supervised classification approach using random forest as a base classifier to classify the low-back disorders (LBDs) risk associated with the industrial jobs. Semi-supervised classification approach uses unlabeled data together with the small number of labelled data to create a better classifier. The results obtained by the proposed approach are compared with those o...

متن کامل

Spectral Estimation of Printed Colors Using a Scanner, Conventional Color Filters and applying backpropagation Neural Network

Reconstruction the spectral data of color samples using conventional color devices such as a digital camera or scanner is always of interest. Nowadays, multispectral imaging has introduced a feasible method to estimate the spectral reflectance of the images utilizing more than three-channel imaging. The goal of this study is to spectrally characterize a color scanner using a set of conventional...

متن کامل

Stock Price Prediction: Kohonen versus Backpropagation

This paper describes the application of two different neural network types for stock price prediction. The prediction is carried out by Kohonen self-organizing maps and error backpropagation algorithm. Both experimental networks deal with price change intervals in contradiction to precise value prediction. The results are presented and its comparative analysis is performed in this paper, as wel...

متن کامل

Credit Assignment through Time: Alternatives to Backpropagation

Learning to recognize or predict sequences using long-term context has many applications. However, practical and theoretical problems are found in training recurrent neural networks to perform tasks in which input/output dependencies span long intervals. Starting from a mathematical analysis of the problem, we consider and compare alternative algorithms and architectures on tasks for which the ...

متن کامل

Neural Networks with Complex and Quaternion Inputs

Many neural network architectures operate only on real data and simple complex inputs. But there are applications where considerations of complex and quaternion inputs are quite desirable. Prior complex neural network models have generalized the Hopfield model, backpropagation and the perceptron learning rule to handle complex inputs. The Hopfield model for inputs and outputs falling on the uni...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016